SUPERBOOLEAN RANK AND THE SIZE OF THE LARGEST 
TRIANGULAR SUBMATRIX OF A RANDOM MATRIX 



ZUR IZHAKIAN, SVANTE JANSON, AND JOHN RHODES 

Abstract. We explore the size of the largest (permuted) triangular submatrix of 
a random matrix, and more precisely its asymptotical behavior as the size of the 
ambient matrix tends to infinity. The importance of such permuted triangular sub- 
matrices arises when dealing with certain combinatorial algebraic settings in which 
these submatrices determine the rank of the ambient matrix, and thus attract a special 
attention. 



1. Introduction 

Let X = {xij)fj = i be a random n x n matrix. We assume that the entries of X are 
taken from some set A and that they are independent and identically distributed, with 
F(xij = a) = p a for some fixed probabilities p a , a £ A. We assume further that 0, 1 G A 
and po,pi > 0. 

The purpose of the present paper is to study the size of the largest triangular subma- 
trix of X, and more precisely its asymptotical behavior as n — > oo. We actually consider 
four versions of this problem; it turns out that to the first order studied here, they all 
have the same answer. 

Definitions 1.1. (i) A submatrix of a matrix A = {ctij)i^M jeN is any matrix obtained 
by deleting rows and/or columns of A. In other words, it is a matrix (ajj)i<=/j<=j for a 
non-empty set of rows KM and a non-empty set of columns JCJV. (We preserve 
the order of the rows and columns in / and J.) 

(ii) A permutation of a matrix is a matrix obtained by a permutation of the rows and 
a (possibly different) permutation of the columns. In particular, a permuted submatrix 
of f ajj) is (p'i r j a ) r ' s= i for a sequence of distinct rows ii, . . . , and a sequence of distinct 
columns j 1 ,...,j e . 

(hi) A (lower) triangular matrix is a square matrix (a^)™ =1 such that a%j = 
when i < j. 

(iv) A special triangular matrix is a square matrix (^ij)^j'— i such that Oij — when 
i < j and a™ = 1 when i = j. (The remaining entries are arbitrary.) 
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Note that a k x I submatrix is determined by two sets I, J of indices with |/| = k, 
\J\ = £, while a permuted submatrix is determined by two sequences and 
ji, ■ ■ ■ ,je of indices, with each sequence without repetitions. 

We define the random variable T n as the maximal size (= number of rows, or columns) 
of a submatrix of X that is triangular; similarly T®, Tfi, T„ s are the maximal sizes of a 
submatrices that are special triangular, permuted triangular and permuted special trian- 
gular, respectively. Equivalently, Tn [Tn S ] is the maximal size of a permuted submatrix 
of X that is [special] triangular. Note that 

T s n <T n <T* and T s n <T^<T^ (1.1) 

The general motivation for studying these quantities comes from boolean algebra or, 
more generally, from (tropical) max-plus algebra [l[ and supertropical algebra [j| . These 
algebras take place over semirings and are fundamentally connected to graph theory, 
in particular matrices over these semirings correspond uniquely to weighted directed 
graphs. With this correspondence, basic algebraic notions are naturally substituted 
by combinatorial ones; for example, the role of the determinant is replaced by the 
permanent. These combinatorial analogous also help to bypass the lack of negation in 
the ground semirings. As a consequence, computational complexity, such as computing 
the rank of a matrix, is not always polynomial and could be NP-complete [l(| over this 
framework. 

The specific motivation occurs if one considers either the boolean case (A = {0, 1}) or 
the superboolean case (A = {0, 1,1"}), the simplest example for a supertropical semiring 
[H, 0]. These papers lead to a new algebraic theory of combinatorics by representing 
matroids by boolean matrices. In this theory, a square matrix is non-singular if and 
only if it is permuted special triangular, and the rank of a matrix is thus the maximal 
size of a permuted special triangular submatrix, see Izhakian and Rhodes [H, 0, 0] for 
details. Consequently, the rank of the random matrix X is T« s . 

Theorem 1.2. Let Q = l/p > 1, and let T* be any of T n ,T%,T%,T% B . Then, 
as n — > oo, 

T n */log Q n^2 + V2, (1.2) 

where — denotes convergence in probability. 

We say that an event occurs with high probability (w.h.p.) if its probability tends to 
1 as n — > oo. Recall that, by the definition of convergence in probability, (jl.2p says that 
for any e > 0, w.h.p. 

(2 + V2-E) log Q n < T* < (2 + V2 + e) log Q n. (1.3) 

By (ll.ip . it suffices to prove the upper inequality for Tn and the lower for T®. The 
upper inequality is proved in Section [2] and the lower in Section El the proofs are based 
on the first and second moment methods. (See e.g. 0, p. 54] for a general description 
of these methods.) 

Remark 1.3. The corresponding problem of the largest square submatrix with only 0's 
(or, equivalently, after interchange of and 1, with only l's) has been studied by several 
authors, see (l2l | and the references therein. It is shown in (jjj that if S n is the size of 
the largest such matrix, then S n j 'log^n 2. This problem can be seen as finding 
the largest balanced complete subgraph of a random bipartite graph; the analogous 
problem of finding the largest complete set in a random graph G(n,p) (or, equivalently, 
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the largest independent set in G(n, 1 — p)) was solved by Bollobas and Erdos [3| and 
Matula [lH, see also 0] and [§]; again the size, C n say, is asymptotically 21ogn n, where 

Q = 1 

Note that T n > S n > \T%/2\ > \T n /2\, which shows that T n , T£ and S n are equal 
within a factor of 2 + o(l), and in particular of the same order of magnitude. However, 
it does not seem possible to get the right constant in front of logo n f° r one °f these 
problems from the other. 

For the largest square zero submatrix and the largest cliques in G(n,p), much more 
precise estimates are known, see [l2j and @)HJ> f° r example, it follows that if 

s(n) = 2 log Q n - 2 log Q log Q n + 2 log Q (e/2) , 

then for any e > 0, [s(n)-e\ < S n < [s(n)+e\ and [s(n) + l-e\ <C n < [s(n) + l + e\ 
w.h.p. (and, in fact, almost surely); in particular the sizes are concentrated on one or 
at most two values. It would be interesting to find similar sharper versions of the result 
above, which leads to the following open problems. 

Problem 1.4. Find second order terms for T n , T^,Tn ,Tn S , and if possible even sharper 
estimates, and see if they differ between the four versions. In particular, what are the 
orders of the differences T% — T n , T n — T%, ■ ■ ■ ? 

Problem 1.5. Are the quantities T n ,T^,Tn ,Tn S concentrated on at most two values 
each? 

Problem 1.6. Prove a version of Theorem 11.21 (or a stronger result) with convergence 
almost surely instead of just in probability, seeing X n as submatrices of an infinite 
random matrix in the natural way. 

Problem 1.7. Find corresponding results when po and p\ depend on n. The case when 
Po tends to 1 (not too fast) seems to be the most interesting. 

Remark 1.8. We consider for simplicity only square matrices X, but the definitions 
extend to general mx n matrices. Since the quantities T n ,T^,Tn ,T% S are monotone if 
we add rows or columns, the result of Theorem 11.21 holds as long as log mj log n — > 1; 
this includes for example the case m/n — > c G (0,oo). We have not investigated other 
cases such as m = n 1 for some 7 > 0. 

1.1. Notation. We let [^J an d \x\ denote the largest and smallest integers such that 
[x\ < x < \x~\ . We write [m, n] for the interval {m, m + 1, . . . , n} of integers between m 
and n. Further, log denotes the natural logaritm log e ; recall that logg n = log nj log Q. 

2. Proof of upper bound 

As said above, it suffices to show that T% < (2 + ^/2 + e) logg n w.h.p. for every 
e > 0. We will use the first moment method, i.e., show that a suitable expectation 
tends to 0. However, for reasons discussed below, we will not obtain the right constant 
by calculating the expected number of (permuted) triangular submatrices of X. Instead 
we consider the following type of submatrices. 

Definition 2.1. Let 1 < I < k. A (k, £)-corner matrix is an £ x I matrix (ajj)| J=1 such 
that 

Oij = if i<j + k- £; (2.1) 
if further a™ = 1 when i = j + k — £, the matrix is a special (k, t)-corner matrix. 
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Thus the £ x £ submatrix in the upper right corner of a [special] lower k x k triangular 
matrix is a [special] (A;, £)-corner matrix, and conversely. Note that if £ < k/2, then 
a (k, £)-corner matrix is 0, and if £ = k then a (k, £)-corner matrix is the same as a 
triangular matrix. 

Let vo(k,£) be the number of entries required to be by (|2.ip . Thus vo(k,£) = I 2 
when £ < k/2; for £ > k/2 we have 

l 2l-k t 

v (k,£) = Y,™x(j + k-£-l,£)=^2(j + k-£-l)+ 1 

j=l j=l j=2i-k+l 

= m-km-k + i) + {n _ m _ l _ i)+(h _ l)t 

4fcl -k 2 -2£ 2 + k-2£ 
- 2 • (2-2) 

Similarly, let isi(k,£) be the number of entries required to be 1 in a special (fc,£)-corner 
matrix. Thus vi(k,£) = when £ < k/2 and v x {k,l) = 2£-k when £ > k/2. Further, let 
is(k,£) = uo(k,£) + z^i be the total number of fixed entries in a special (fc, £)-corner 
matrix. If £ > k/2, then by (JZ2D 

V ( k ,e) = ikt - k2 -f- k + 2t . (2.3) 

Let 1 < £ < m and let Y m ,t be the number of permuted (m,£)-corner submatrices 
in X. Note that if X contains a permuted triangular mxm submatrix A, then a suitable 
submatrix of A is a permuted (m,£)-corner submatrix of X. Hence, if Tfi > m, then 
Y m .e > L and Markov's inequality yields 

P(TP > m) < F(Y m/ > 1) < E7 V . (2.4) 

The expected value E Y m ^ is easily computed. The number of permuted £ x £ submatrices 
of X is {n)i ■ (n)g, where {n)i = n(n — 1) • • • (n — I + 1), and for each such matrix, the 

probability that it is triangular is p^ m ^\ with vq{iti,£) given above. Thus, 

= (n)j ■ p u o{m/) < exp(2£logn - logQ • v Q {m,£)). (2.5) 

Taking m = [slogn] and £ = \tlogn\ for some fixed s and t with s/2 < t < s, we have 
by (J23D and (|23|) . 

E7 m/ < exp(2t(logn) 2 -logQ- (2st - s 2 /2 - i 2 )(log n) 2 + 0(log n)) . (2.6) 

We see from (|2.6p that if we choose s and t such that s/2 < t < s and 

2t-logQ- (2st - s 2 /2 - t 2 ) < 0, (2.7) 

then E Y m/ -> and thus by (pill) 

P(T n P > s log n) = P(T n P > m) < E F m ^ -> 0; (2.8) 

hence T% < slogn w.h.p. 

Write for convenience 7 = 1/logQ. The left hand side of (|2.7p is, for fixed s, maxi- 
mized when t = s — 7, and then its value is, by a short calculation, 

s 2 s 2 - 4s7 + 2 7 2 (s - 2 7 ) 2 - 2 7 2 



2s -7 



27 27 27 
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which is negative for s > 27 + \/27- Consequently, taking any s > (2 + \/2)7 and 
t = s — 7, which clearly satisfies s/2 < t < s, (|2.8]1 yields T% < s log n w.h.p. It remains 
only to note that 7 log n = log nj log Q = logQ n. 

Remark 2.2. If we instead estimate the number of (permuted) triangular submatrices, 
we are taking £ = m and t = s in the calculations above and we only obtain the weaker 
estimate T% < (4 + e) logQ n w.h.p. The reason that the first moment method does not 
yield a sharp estimate in this case is that triangular submatrices of large size tend to 
occur in large clusters; thus the expected number of such submatrices of a given size 
can tend to infinity although the probability that the number is nonzero tends to 0. 
See also the proof of the lower bound in Section [3l which shows that a (fc, ^-corner 
matrix of close to maximal size w.h.p. can be extended to a triangular submatrix in 
many different ways. 

3. Proof of lower bound 

We begin by stating three lemmas; the first is elementary and the two others contain 
the main probabilistic arguments. The proofs are provided later. 

Lemma 3.1. Suppose that k\ > £\ > 1, fa > £2 > 1> and 2(£\ — £2) > fei — &2 > 0. 
Then every special (ki,£±) -corner matrix contains a special (k2, £2) -corner submatrix. 

Lemma 3.2. Let e > 0. There exists some k = k(n) and £ = £(n) with 

(2 + V2-e) log Q n < k < (2 + V2) log Q n, 

and 

(1 + V2-E) log Q n < £ < (1 + V2) logg n 
such that w.h.p. X contains a special {k,£)-corner submatrix. 

Lemma 3.3. Let X' be the submatrix (a^)j> n /2, j<n/2 comprising the lower left quarter 
of X . Let e > and let k = k{n) and £ = £{n) be such that k/2 < £ < k and 
k — £ < (1 — e) loggn. If X' contains a special (k, £)-corner submatrix, then w.h.p. X 
contains a special triangular k x k submatrix, and thus T^> k. 

Proof of lower bound in Theorem \1.SX Let < e < 1/3. Let X' be the lower left quarter 
of X as in Lemma 13.31 By Lemma 13.21 there exists k\ and £\ with 

(2 + V2 - e) logQ \n/2\ <k l <{2 + V 7 2) logQ \n/2\ , 
(1 + V2 - e) logQ [n/2\ < h < (1 + yfe) log Q [n/2\ 

such that there w.h.p. is a special (k±, £i)-corner submatrix Mi of X' . 

Note that k\ — l\ < (1 + e) logQ n. Let d = \2e logQ n] , k = k\ — 2d, and I = t\ — d. 
By Lemma 13. 11 there is a special (fc,£)-corner submatrix M2 of M±. It is easily verified 
that k and £ satisfy the conditions of Lemma f3.3l and thus Lemma f3.3l shows that w.h.p. 

T s n > k> (2 + ^2 - 5e) logQ n + 0(1). 

The bound > (2 + y/2-e) log Q n w.h.p. follows by replacing e by e/6. This completes 
the proof of Theorem O since T* > T* by (JTTJJ). □ 

It remains to prove the lemmas. 
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Proof of Lemma \3.1{ Let A be a special (fe, £)-corner matrix. The submatrix obtained 
by deleting the first row and last column is a special (k—2, £— l)-corner matrix. Similarly, 
we obtain a special (k — l,£— l)-corner matrix by deleting the last row and last column, 
and a special (k,£ — l)-corner matrix by deleting the last row and first column. 

The lemma now follows by induction on l\ — £2- □ 

Proof of Lemma \3.2\ We may assume that e < 1/4. We consider a block version of 
(k, ^)-corner matrices. 

Let N be a large integer and let K = \(2 + y/2 - e)N] and L = [(1 + y/2 - e)N] = 
K — N; note that K > L > K/2. Let n\ = \n/L\ and divide the interval [1, n] into 
the L subintervals Ei = \{i — l)n\ + 1, £m], i = 1, . . . , L, ignoring the possible remainder 
at the end. Let X{j be the n\ x m submatrix (x rs ) r ^Ei,seE- of X. 

Let 

9 =rJV- 1 log Q nl (3.1) 

and consider the submatrices of X obtained by choosing q rows from each Ei and q 
columns from each Ej, i,j = 1, . . . , L. We denote the set of all such submatrices by M; 
each M £ Ai is identified by its set of rows and columns, and the number of them is 
thus 

\M\=^j . (3.2) 

Each M is a Lq x Lq submatrix of X which consists of I? blocks My, i,j £ {1, . . . , L}, 
where My is a q x q submatrix of Xij . 

We say that the submatrix M G A4 is good (for a given realization of the random 
matrix X) if M y - = when % < j + K — L and Mij = I (the gxg identity matrix) when 
i = j + K — L; otherwise M is called bad. Thus, a good submatrix can be seen as a 
special (K, L)-corner matrix of q x q matrices. 

Note that a good submatrix M is a special (i^ 9 , Lg)-corner matrix, and that k = Kq 
and £ = Lq satisfy the inequalities in the lemma if N and q are large enough. Hence 
it suffices to show that if iV is large enough, then there exists w.h.p. at least one good 
submatrix M G M. 

Let Lm be the indicator that M is good, i.e., Lm = 1 if M is good and Lm = if M 
is bad, and let Z = Yl<MeM De ^ ne number of good submatrices M G M.. Our task 
is to show that Z > 1 w.h.p., which we do by estimating the mean and variance. 

In order for M to be good, the number of submatrices required to be is vq{K, L), 
and the number required to be L is u\{K,L). Consequently, the number of entries 
required to be is u (K,L)q 2 + ui(K,L)(q 2 — q) = v(K,L)q 2 - vi(K,L)q and the 
number of entries required to be 1 is i>i(K,L)q. Hence, denoting the probability that 
M is good by tt, for each M G A4, 

VT = P(/ M = 1) = p HK,L) q ^{K,L)y^K,L) g _ ^ 

We have by recalling K = L + N, 

4(L + N)L - (L + N) 2 - 2L 2 + 0(N) 



v(K, L) 



2 

L? + 2LN -N 2 
o + 0(N) 



2 

(2 + 2^2 - (2 + y/2)e + £ —^jN 2 + O(N) < (2 - e/2)LN, (3.4) 
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provided N is chosen large enough. We fix such an N; thus K and L are now fixed, 
while n — >• oo. By (|3.ip . 

log n = log Q n • log Q = Nq log Q + 0(1). (3.5) 

Furthermore, (|3.ip also yields, as n — > oo, g < logg n <C m. Hence, by Stirling's 
formula, 

log = qlogni +°(J^) ~ lo g(? ! ) = qlogn + O(qlogq). 

Consequently, by (j3T2]h (^3|) . (J33D and (j33|) . 

EZ = |-M|P(/ M = 1) = |7W|tt 

= exp (2L(q log n + 0{q log g)) - u(K, L)q 2 log Q + 0(qf 

> exp^L^logn) - (2 - e/2)LNq 2 logQ + 0(<? log q) 

= exp({eLNlogQ/2)q 2 + O(glogg)) oo. (3.6) 

To estimate the variance Var(Z), we first calculate the covariance Cov(Im, Im 1 ) = 
K(ImIm') — ^(Im)^(Im') for two submatrices M,M' G .M. Let be the number of 
common rows in Ei of M and M', and let bj be the number of common columns in Ej. 
Then has a^bj entries in common with M[-, so their union has 2q 2 — a^bj elements. 

For i < j + K — L, we have 

P(M« = = M,) V- a<6j _ a . 6 . 
- — = p a! \ (3.7) 



F(M ij = 0)¥(M( i = 0) ~ J/ 



For i = j + K — L, we want Mjj = M-j = I, so we have to consider also the required 
positions of the l's in My and M^-. In many cases, the rows and columns chosen 
for Mij and M[- are such that the conditions M; L j = I and M'- = I are contradictory, 
so P(Mij = M[- = I) = 0. Otherwise, the a^bj common entries of Mij and M[- contain 
some number of entries, d say, that have to be 1 in both and M-j, while the 
remaining a^bj — d have to be in both, and then 

¥{Mjj = Mjj = I) _ {a . b ._ d) d _ b . /p \ d . ( , 

nM t3 = i)nM , ij = i) ~ Po Pi ~ Po \p~J ' 1 j 

note that < d < min(aj, bj). Combining (13. 7b and (|3.8p by taking the product over 
all pairs with i < j + K — L, and recalling that K — L = N, we obtain the upper 
bound 

Wm = i)p(/m' = i) ~ Po ma Hl P1 J (3 - 9) 

Let 7T = P(Iju = 1), C\ = max{(po / Pi) L , 1} and, for a given pair M, M', A = J2i a i an d 
B = Y2j bj, be the numbers of common rows and columns, respectively, of M and M' . 
Then ([3Jl yields 

Cov(ImJm') < P(/a/ = hi' = 1) < Q^^+^^CiV. (3.10) 

Let 

r = r((a i ),(6 i )) = E a *^' C 3 ' 11 ) 

j,j':i<j'+A'' 
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and let t(A, B) be the maximum of r for given sums A = en and B = ^ • bj, with 
Oj, bj G [0, g]. If i\ < 12 and we increase by some A to + A and decrease Oj 2 by 
the same A to a« 2 — A, then r = ^ j. i< j +N aibj cannot decrease. The same happens if 
we decrease bj 1 and increase bj 2 with j\ < Consequently, given A and B, the sum r 
is maximized when, for some indices G 

ai = q when i < i*, a, = when i > i*; (3-12) 

6j = when j < j*, 6j = g when j > j*. (3.13) 

Returning to (13TTU1) . we have the estimate Cov(/ M ,I M ') < Q r(A ' jB) Cj 4 7r 2 . If A = or 
if = 0, then M and M' are disjoint submatrices of X, and thus independent, so in 
this case Cov(Im > Ija') = 0- Consequently, 

Var(Z)= Cov(Im,Im')< ]T Q T ^Cf it 2 , (3.14) 

M,M' M,M':A,B>0 

where A and -B are defined as above, given M and M'. 

For a given M £ Ai, the number of submatrices M' G .M with given ai,...,a^, 
bi,...,b L is 

-«^-n(0»n(0(:-v)- 

We have, for any a G [0, q], 

\J\q-a) ^ 1 \q-a) r, ~l r Q ^ 



(7) " (7) to n *-* v "<- / v "> 



J- J- rj 1 — 7. \ T7.1 / \ T7.1 / 



Thus, recalling (I3.2|) . 

^((Qj)i,(fej);g) /gV+ B 

|M| ~ \nj 

Moreover, given A and B, the number of choices of a±, . . . , cll with sum ,4 is < (A+l) L < 
2 AL , and similarly the number of 61,... ,&£ is < 2 BL . Hence, for each M G Ai, the 
number of M' with given A and is at most, using (|3.15p . 

. T dt/q 2 \A+B /2 L q 2 \A+B /CoQ 2 \A+B 

2 AL 2 BLfQ_\ M= (±J_\ M \<(b2L.) \ M \, 

\niJ \ n\ / V n / 

where C2 = {L + 1)2 L (for n large enough). Since M can be chosen in \Ai\ ways, and 
A,B < Lg, (I3T4D yields, recalling E2= |A^|vr, 

Var(Z)< £ |^|(^!) A+B |^|Q^) C A+^ 7r 2 
,4,5=1 n 

= (EZ) a £ (^) A+B Q r(A ' B) , (3.16) 
with C 3 = CiC 2 . We write (1XTBD as Var(Z) = (EZ) 2 J2a,b K a ,B), with 

X [A,B) = (^f) A+B Q^ B \ (3.17) 
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Claim. If A,B € then X(A,B) < max{A(l, 1), X(Lq, Lq)}; in other words, 

X(A, B) attains its maximum for A = B = 1 or A = B = Lq. 

To prove the claim, let (aj) and (bj) be vectors that maximize r in (|3.1ip for some 
given A and B; we may thus assume that (|3.12p and (I3.13D hold. We first note that if 
A < Nq, then by (13. 12ft we have i* < N and ai = when i > N; hence 

L 

t(A, B) = t = ^2 a-ibj = ^2 a i b i = AB 

i,j:i<j+N i,3=l 

and thus 

X(A,B) = (C 3 q 2 /n) A+B Q AB . 

Keeping A fixed, this is maximized by either B = 1 or B = Lq. 

On the other hand, if A > Nq, then (I3.12h yields a% = q when i < N. Hence, 
increasing any bj by 1 will increase r in (|3.1ip by Yli-i<j+N a « — Nq, and thus r(A, B + 
1) > t(A, B) + Nq. Consequently, by (I3TT71) and (IQl T 

X{A ;f\ 1] = (9^) Q r(AB + l)-r(A,B) > f^W, > C f > X 

X(A,B) v n ) v n ) 

and thus X(A, B) < X(A, Lq) for any B < Lq. 

Hence, for any fixed A < Lq, X(A, B) is maximized by either B = 1 or B = Lq. By 
symmetry, for fixed B, the maximum is attained for A = 1 or A = Lq. Consequently, 
the maximum for all A,B£ [1, Lq] is attained for A, B G {1, Lq}. Moreover, A(l, Lq) = 
X(Lq,l) by symmetry and X(Lq,l) < X(Lq,Lq) by the case A > Nq above, and the 
claim follows. 

We calculate easily the two extreme cases. For A = B = 1, r(l, 1) = 1 and 

A(M) . ( ^)Vo(!^). (, 18) 

For A = B = Lq, all a% = bj = q, and thus r(Lq,Lq) = v(K,L)q 2 . Hence, recalling 
q = O(logn), (JES) and dOjl . 

.2 



7? 



A(Lg, L<7) = ( r«_ ) 



= exp (^-2Lq log n + 0(q log g) + v(K, L)q 2 log Q 
= exp((-2LiVg 2 + V [K, L)q 2 ) log Q + 0(glog g)) 
< exp(-(eLiVlogQ/2)g 2 + 0(glogg)Y (3.19) 



For large n, this is less than exp(— 2iVg) < n 2 . Consequently, the claim and (|3.18p - 
(f3TT9|) shows that for all A,B< Lq, 

X(A,B) = 0( 1 -^). (3.20) 



Finally, by ([515]) and (f3T20l) . 

^t^-O^-Opg)-*), (3,!, 



v ; A,B=1 
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as n — > oo. This is what we need: by Chebyshev's inequality 

hence (|3.2ip yields ¥(Z = 0) — > 0, and thus Z > 1 w.h.p., which completes the proof. □ 

Proof of Lemma \3.3l Condition on X' and fix a special (k, £)-corner submatrix M 1 = 
(Xi> rt j> s )f s =i °f tnus n / 2 < i[ < ■ ■ ■ < i'i < n and 1 < j{ < ■ ■ ■ < j' t < n/2. 
We try to complete M' to a k x k special triangular matrix by adding k — i rows 
«!<•••< ik~e < n /2 and k — I columns n/2 < ji < • • • < ik-t < ft>; we do this by 
trying the rows one by one until we find first a suitable i\ (i.e., one with x^ji = 1), then 
a suitable 12 (one with Xj 2 j/ = and x i2 j^ = 1), and so on until ik-e, and similarly for 
hi ■ ■ ■ ih-t- 

Let r < k — l. Each time we try a row in order to find i r , we want one specific entry in 
it to be 1 and r — 1 others to be 0; the probability of this is 7r r = Pq~ 1 Pi-, independently 
of X 1 and what has happened earlier. If T r is the number of rows that we have to try 
until we find i r , then T r thus has a geometric distribution 

P(T r = t) = (1 -vt^'V, £ = 1, 2, . . . . 

This distribution has mean ET r = l/vr r and variance VarT r = (1 — 7r r )/ir^; hence the 
sum S := Ti + ■ ■ ■ + T^-i has mean 

k-e k-e k-e 

ES = ^ET r = Y J *; 1 = Y,Pi 1 Q r ~ 1 = 0(Q k ^) =0(n 1 - £ ) = o(n) 

r=l r=l r=l 

and variance 

fc— ^ fc— e 

Var 5 = ^ Varr r < ^vr r 7 2 = 0(Q 2 ^)) = 0{n 2 ^) = o(n 2 ). 

r=l r=l 

The search for i\, . . . ,ik-e succeeds if S < n/2. Consequently the probability of failure 
is, using Chebyshev's inequality, for n so large that E5 1 < n/4, 

/ _ \ VarS 1 VarS , , 

P(S > " /2) 5 (n/2 -Eg)' S (W = 
Hence, w.h.p. we succeed and find suitable rows ii, . . . ,ik—£', similarly w.h.p. we find 
also suitable columns ji, ■ ■ ■ ,jk-e, an d we can extend M' to a special triangular k x k 
matrix. □ 

Note that w.h.p. S is much less than n/2, so we have a wide margin in this proof and 
there are w.h.p. many different choices of rows and columns that work, and thus many 
different ways to extend M 1 to a special triangular matrix, cf. Remark 12. 2[ 
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